Do extremists impose the structure of social networks? 
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The structure and the properties of complex networks essentially depend on the way how nodes 
get connected to each other. We assume here that each node has a feature which attracts the others. 
We model the situation by assigning two numbers to each node, u and a, where ui indicates some 
property of the node and a the affinity towards that property. A node A is more likely to establish 
a connection with a node B if B has a high value of to and A has a high value of a. Simple computer 
simulations show that networks built according to this principle have a degree distribution with a 
power law tail, whose exponent is determined only by the nodes with the largest value of the affinity 
a (the "extremists"). This means that the extremists lead the formation process of the network and 
manage to shape the final topology of the system. The latter phenomenon may have implications 
in the study of social networks and in epidemiology. 

PACS numbers: 89.75.Hc, 05.10.-a 
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The study of complex networks is currently one of the 
hottest fields of modern physics [3, 13 ■ A network (or 
graph) is a set of items, called vertices or nodes, with 
connections between them, called edges. Nodes linked by 
an edge are neighbours and the number of neighbours of 
a node is called degree. Complex weblike structures de- 
scribe a wide variety of systems of high technological and 
intellectual importance. Examples are the Internet, the 
World Wide Web (WWW), social networks of acquain- 
tance or other connections between individuals, neural 
networks, food webs, citation networks and many others. 

One of the crucial questions concerns the formation of 
these structures. Complex networks are in general sys- 
tems in evolution, with new nodes/edges that get formed 
and old ones that get removed or destroyed. The cur- 
rently accepted mechanism finds its roots in an old idea 
of Price [J, based on the so-called preferential attach- 
ment, which means that a newly formed node A builds 
an edge with a preexisting node with a probability that is 
proportional to the degree of the latter node. Networks 
constructed in this way d, IE have a degree distribu- 
tion with a power law tail, as observed in real networks. 
This simple rule, however, makes implicitcly the strong 
assumption that each node is at any time informed about 
the degree of all other nodes, which is certainly not true, 
especially for gigantic systems which contain many mil- 
lions of nodes, like the WWW. We rather believe that 
the key behind the building of a connection between a 
pair of nodes lies essentially in the mutual interaction of 
the two nodes, (almost) independently of the rest of the 
system: two persons usually become friends because they 
like each other. 

In this letter we have social networks in mind, but nev- 
ertheless we will speak generally about networks, as we 
believe that our model has a more general validity. The 
mechanism we propose is that any node has some prop- 
erty (beauty, richness, power, etc.) by which the others 



are attracted. We indicate the property with a positive 
number u>, the attractiveness by another positive num- 
ber a. We assume that high values of u> correspond to 
a high degree of the property (the most beautiful peo- 
ple, for instance). Both ui and a are attributes of single 
nodes/individuals, so they take in general different val- 
ues for different nodes. What we need is a knowledge 
of the distribution of 10 and a in the network. For the 
property cj, distributions which vanish for high values 
of lj, like exponentials or power laws, are realistic. As 
far as the affinity a is concerned, it is less clear which 
distributions can be considered plausible, therefore we 
tested several functions. We remark that the idea that 
the nodes have individual appeal already exists in the lit- 
erature on complex networks. Bianconi and Barabasi Q 
assigned a parameter r\ called "fitness" to each node of 
the network and the linking probability becomes propor- 
tional to the product of the degree with the fitness of the 
target node. In the same framework, Ergiin and Rodgers 
[9j proposed a different ansatz for the linking probabil- 
ity, which in their case is proportional to the sum of the 
fitness and the degree of the target node. Both models 
however are based on preferential attachment. Caldarelli 
et al. [lfj proposed instead a variation of the fitness 
theme which eliminates preferential attachment, so that 
the formation principle of networks lies in the attraction 
which nodes exert on each other by virtue of their indi- 
vidual quality/importance, which is actually in the spirit 
of our work. So, in 10], the linking probability is simply 
a function of the fitness values of the pair of nodes, and 
several possible choices for this function are introduced 
and discussed. 

Our expression for the probability pab of a node A to 
build an edge with a node B is also a function of the 
individual attributes of the nodes, ui and a. We adopt 
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the ansatz 
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where cb is a normalization constant and (f>(u>) the dis- 
tribution function of the property to, whereas we will indi- 
cate with ip( a ) the distribution function of the affinity a. 
What Q says is that the pairing probability is inversely 
proportional to the relative frequency of the property to 
in the network. Thinking again of a social system, the 
idea is that there is a general tendency to be more at- 
tracted by those subjects who are characterized by high 
values of u>. In a network of sexual relationships, for in- 
stance, the best looking people usually have the greatest 
chances to be chosen as sexual partners. We believe that 
the choices of the people are not influenced by the ab- 
solute importance of lo, which is a vague and abstract 
concept, but rather by the perception of the importance 
of the property lo within the society, which is related to 
its distribution. This is why we associated the pairing 
probability to the relative frequency 4>(co) and not di- 
rectly tow, at variance with 10]. Accordingly, the larger 
to, the lower the occurrence <fi(u>) of that degree of the 
property in the network, and the edge-building proba- 
bility gets higher. On the other hand, for a given node 
B, characterized by its property lob, the other nodes will 
feel an attraction towards B which varies from a subject 
to another. This modulation of the individual attraction 
is expressed by the exponent a a in The probability 
Pab increases with a a, justifying the denomination of 
"affinity" we have given to the parameter a. The param- 
eter a must be taken in the range [0, 1] for normalization 
purposes 

The expression Q might look ad hoc, because of the 
power law dependence on a. This is not the case, due to 
the freedom we have in the choice of the a's: if we have 
an arbitrary probability pab for the node A to be linked 
to B, in most cases one is able to find a number ctA such 
that pab = cb /[<f>( UJ B)] aA , so to reproduce the wished 
probability. 

Our simple model is a generalization of the so-called 
"Cameo principle" which has recently been introduced 
by two of the authors 12]. There, the affinity a was 
the same for all nodes and there was consequently no 
correlation between pairs of nodes. In this case it was 
rigorously proven that the network has indeed a degree 
distribution with a power law tail and that the exponent 
7 is a simple function of a, more precisely 

• if 4>(uj) decreases as a power law with exponent (3 
when lo — > oo, 7 = 1 + 1/a — l/a(3; 

• if 4>(lu) vanishes faster than any power law when 
lo — > 00, 7 = 1 + 1/a. 

For our generalization we studied the problem numer- 
ically, by means of Monte Carlo simulations, but an an- 
alytical proof of the results we show here is in progress 



le+07 



le+06 



I 100000 



10000 



1000 



100 




1000 
Degree 



10000 



FIG. 1: Cumulative degree distribution for a network where 
4>{w) = e - ^ and ip(a) is constant in [0,0.7]. In the double 
logarithmic scale of the plot a power law tail would appear as 
a straight line, as in the figure. 



|13| . In order to build the network we pick up a node A 
and build m edges with the other nodes of the network, 
with probability given by (I). The procedure is then re- 
peated for all other nodes of the network. We remark 
that our construction process is static, i. e. all nodes of 
the network are there from the beginning of the process 
and neither nodes are added nor destroyed. However, 
the principle can as well be implemented in a dynamical 
way, with new nodes which are progressively added to 
the network 

We fixed the outdegree m to the same value for all 
nodes, as it is done in the famous model of Barabasi and 
Albert @ (we set m — 100). The number N of nodes 
was mostly 100000. 

We have always used a simple exponential for 4>(lo). 
Fig. ^ shows the cumulative degree distribution of the 
network constructed with a uniform affinity distribution 
ip(ct) = const., for a in the range [0, 0.7]. The cumulative 
distribution is the integral of the normal distribution. 
So, for a value k of the degree we counted how many 
nodes have degree larger than k. The summation reduces 
considerably fluctuations and the analysis gets easier. If 
the degree distribution is a power law with exponent 7, 
its integral will be again a power law but with exponent 

7 - I- _ 

In Fig. ^ we see that indeed the cumulative distribu- 
tion ends as a straight line in a double logarithmic plot, 
so it has a power law tail. We performed many trials, by 
varying the range of the uniform distribution ijj(a), and 
by using other kinds of distribution functions for a, like 
gaussians, exponentials and power laws. We found that 
the result holds in all cases we considered. 

Another striking feature of our findings is shown in 
Fig. [21 Here we plot the cumulative degree distribu- 
tions for two networks, where ip{a) is uniform and we 
chose the affinity ranges such that they share the same 
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FIG. 2: Cumulative degree distribution for two networks char- 
acterized by uniform distributions t[>(a). The upper limit of 
the a-range is the same in both cases. The power law tails 
have the same slope. 



upper limit a max ([0,0.8] and [0.3,0.8], respectively, so 
ct-max = 0.8). We see that the tails of the two curves 
have the same slope, which suggests that 7 only depends 
on a max . We repeated this experiment several times, for 
different ranges and taking as well exponential and gaus- 
sian distributions for a. Within errors, we confirmed this 
remarkable result. 
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FIG. 3: Dependence of the exponent 7 — 1 of the cumulative 
degree distribution on the upper limit a max of the affinity 
range. The data points can be fitted by the simple ansatz 
a/a max ■ 

Fig. |21 shows how the exponent 7 — 1 of the cumula- 
tive degree distribution varies with a ma x- The pattern 
of the data points follows an hyperbola a/a max , with a 
coefficient a = 1.29; this is very close to what one gets for 
the original Cameo principle 12], where 7 — 1 = 1/a. It 
is likely that in the limit of infinite nodes the coefficient 
would indeed converge to one. Since a max can be cho- 
sen arbitrarily close to zero, from the ansatz a/a ma x we 



deduce that, within our model, we are able to build net- 
works with any value of 7 greater than (about) 2. This is 
fine, as for the great majority of complex networks 7>2 
as well. 

So, we have discovered that the nodes with the highest 
affinity a, that we call "extremists" for obvious reasons, 
are responsible for the exponent 7 of the power law tail 
of the degree distribution of the network. This is valid 
independently of the distribution ip(a), so it works even 
in the case where the extremists are just a very small 
part of the population |14| . If we consider terrorism net- 
works, for example, the leaders of the group (those with 
highest charisma/a-i) are the hubs of the network, i.e. the 
most connected individuals, but their relative importance 
is determined by the most fanatic followers (those with 
largest a). We have then shown that there is a sort of 
time-dependent hierarchy among the nodes: the extrem- 
ists lead the formation process, the hubs dominate the 
structure once the network is built. 

We give here a hint to the analytical proof of this re- 
sult. Let us consider the simple case of a discrete distri- 
bution 'J? (a) of the form 

m 

*(a) = 5^A i <y(a-a i ) > (2) 

i=l 

with Xi > and YaLi A* = 1. So the fraction of nodes 
x with a(x) = (Xi is Xi > 0. The validity of the result lies 
in the fact that the global degree distribution is given by 
a superposition of the degree distributions associated to 
nodes with the same ou. Since each of those distribution 
has a power law tail [I3] , the overlap is dominated by the 
term having the fattest tail, i. e. the smallest exponent 
7, which corresponds to the maximum a max of the a's, 
due to the relation 7 = 1 + 1/a. In this way, any function 
can be considered as the limit of a sum like J5J), when the 
number of terms goes to infinity; for more details see p^ ] 
and [l|. 

We know that the exponent 7 is a crucial feature 
of complex networks in many respects. For epidemic 
spreading, for example, there is no non-zero epidemic 
threshold so long as 7<3, which would have catas- 
trophic consequences. If the network is in evolution, to 
control the extremists would mean to be able to exert an 
influence on the future topology of the network, which 
can be crucial in many circumstances. 

From a practical point of view, it is not obvious how to 
model things like attractiveness (or fitness), which usu- 
ally are out of the domain of quantitative scientific in- 
vestigations. However, our result on the leading role of 
the extremists is quite robust, as it does not depend on 
the specific function ip(a) that one decides to adopt. The 
attempt to mathematically modelize apparently abstract 
features of social systems (here the "attractiveness") is 
not isolated 0. The last few years witnessed a big effort 
to describe society as a physical system 0,0,0], with 
people playing the role of atoms or classical spins un- 
dergoing elementary interactions. There are meanwhile 
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several models to explain how hierarchies |20| and con- 
sensus j2]| may originate in a society and in these models 
abstract items like opinion, confidence, etc. are associ- 
ated to well-defined mathematical variables. Although 
one must always be careful not to demand too much 
from such models, the first results of this line of research 
are encouraging; with the consensus model of Sznajd |22^ 
one could reproduce the final distribution of votes among 
candidates in Brazilian and Indian elections |U 0] ■ 

In conclusion, we have introduced a simple criterion for 
the nodes of a complex network to choose each other as 
terminals of mutual connections: each node has a prop- 



erty u> which attracts the other nodes to an extent which 
depends on another individual parameter a. Networks 
built in this way are always characterized by a degree 
distribution with a power law tail. The exponent of the 
power law is determined uniquely by those nodes which 
are most sensible to the property u>. Acting on such 
nodes could be an effective way to control the structure 
of evolving networks. 
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